Using the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection
نویسندگان
چکیده
Future considerations would include ne tuning the latent SVM used to train the DPM, using stacked autoencoders to learn more complex feature representations, and optimize runtime of training algorithm to allow for larger training sets. Acknowledgements Special thanks to Professor Andrew Ng and Adam Coates for the advice they provided through the course of this project. Given its performance in recent years' PASCAL Visual Object Classes (VOC) Challenge, the Deformable Part Model (DPM) is widely regarded to be one of the state-of-the-art object detection and localization algorithms. The DPM as described by Felzenszwalb, et. al. uses Histogram of Oriented Gradients (HOG) descriptors as the underlying feature representation for an object. In this paper, we consider using features learned by a single-layered, sparse autoencoder as a substitute for HOG descriptors in the DPM. The rationale for this is that these learned feature descriptors may capture additional details present in the image that are not re ected in a human-designed set of features such as HOG. Using this more descriptive set of features may in turn yield a better object detector. Additionally, while it is true that deep learning algorithms such as the sparse autoencoder alone generally do not perform as well compared to other vision algorithms, it may be possible to integrate these learned feature descriptors into an existing image classi cation system such as the DPM. To further evaluate the performance of such an integration, we also consider the e ect of factors such as block normalization and colored features on the performance of our autoencoder-backed DPM. Finally, we examine the perfomance of the autoencoder-backed DPM across several di erent object classes from the PASCAL VOC Challenge.
منابع مشابه
Analysis and Synthesis of Facial Expressions by Feature-Points Tracking and Deformable Model
Face expression recognition is useful for designing new interactive devices offering the possibility of new ways for human to interact with computer systems. In this paper we develop a facial expressions analysis and synthesis system. The analysis part of the system is based on the facial features extracted from facial feature points (FFP) in frontal image sequences. Selected facial feature poi...
متن کاملFisher Discriminant Analysis (FDA), a supervised feature reduction method in seismic object detection
Automatic processes on seismic data using pattern recognition is one of the interesting fields in geophysical data interpretation. One part is the seismic object detection using different supervised classification methods that finally has an output as a probability cube. Object detection process starts with generating a pickset of two classes labeled as object and non-object and then selecting ...
متن کاملApplication of Combined Local Object Based Features and Cluster Fusion for the Behaviors Recognition and Detection of Abnormal Behaviors
In this paper, we propose a novel framework for behaviors recognition and detection of certain types of abnormal behaviors, capable of achieving high detection rates on a variety of real-life scenes. The new proposed approach here is a combination of the location based methods and the object based ones. First, a novel approach is formulated to use optical flow and binary motion video as the loc...
متن کاملOnline multiple people tracking-by-detection in crowded scenes
Multiple people detection and tracking is a challenging task in real-world crowded scenes. In this paper, we have presented an online multiple people tracking-by-detection approach with a single camera. We have detected objects with deformable part models and a visual background extractor. In the tracking phase we have used a combination of support vector machine (SVM) person-specific classifie...
متن کاملPerformance evaluation of block-based copy- move image forgery detection algorithms
Copy-move forgery is a particular type of distortion where a part or portions of one image is/are copied to other parts of the same image. This type of manipulation is done to hide a particular part of the image or to copy one or more objects into the same image. There are several methods for detecting copy-move forgery, including block-based and key point-based methods. In this paper, a method...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010